|
|
|
| Home | Wayback Machine | Archive-It | Blog | Heritrix |
| Anonymous User (login or join us) |
Internet Archive crawldata from Aaron Swartz Crawl, captured by crawl345.us.archive.org:aaronswartz from Sat Jan 12 20:53:44 PST 2013 to Sat Jan 12 13:20:30 PST 2013.
This item is part of the collection: Away from Keyboard: Aaron H. Swartz
Identifier: AS-20130112205344-crawl345
Contributor: Internet Archive
Crawljob: aaronswartz
Creator: Internet Archive
Date: 2013
Firstfiledate: 20130112205430
Firstfileserial: 00010
Identifier-access: http://www.archive.org/details/AS-20130112205344-crawl345
Lastdate: 20130112132030
Lastfiledate: 20130112211830
Lastfileserial: 00019
Mediatype: web
Numwarcs: 10
Operator: lekash@archive.org
Scandate: 20130112205430
Scanner: crawl345.us.archive.org
Scanningcenter: sanfrancisco
Sizehint: 10259511570
Sponsor: Internet Archive
Publicdate: 2013-01-13 03:02:46
Addeddate: 2013-01-13 03:02:46
Imagecount: 277254
Keywords: crawldata
| Information | Format | Size |
| AS-20130112205344-crawl345_files.xml | Metadata | [file] |
| AS-20130112205344-crawl345_meta.xml | Metadata | 1.4 KB |
| Other Files | Web ARChive GZ | WARC CDX Index | Item CDX Index | Item CDX Meta-Index | Text |
| AS-20130112205344-00010.warc.gz |
955.0 MB
|
2.2 MB
|
|||
| AS-20130112205344-crawl345.cdx.gz |
15.1 MB
|
||||
| AS-20130112205344-crawl345.cdx.idx |
9.2 KB
|
||||
| AS-20130112205705-00011.warc.gz |
955.9 MB
|
2.0 MB
|
|||
| AS-20130112210011-00012.warc.gz |
971.4 MB
|
1.3 MB
|
|||
| AS-20130112210233-00013.warc.gz |
967.4 MB
|
1.4 MB
|
|||
| AS-20130112210455-00014.warc.gz |
953.8 MB
|
1.3 MB
|
|||
| AS-20130112210723-00015.warc.gz |
953.7 MB
|
1.5 MB
|
|||
| AS-20130112210952-00016.warc.gz |
953.7 MB
|
2.1 MB
|
|||
| AS-20130112211244-00017.warc.gz |
1.1 GB
|
1.9 MB
|
|||
| AS-20130112211546-00018.warc.gz |
990.2 MB
|
1.6 MB
|
|||
| AS-20130112211828-00019.warc.gz |
967.6 MB
|
492.7 KB
|
|||
| MANIFEST.txt |
660.0 B
|